|
|
Accession Number |
TCMCG024C33014 |
gbkey |
CDS |
Protein Id |
XP_022010337.1 |
Location |
join(172434061..172434817,172435806..172435902,172436061..172436163,172436263..172436316,172436396..172436470,172436611..172436679,172436790..172436873,172436969..172437157,172437247..172437746,172437838..172437967,172438058..172438149,172438233..172438411,172438509..172438711,172438782..172438883,172438961..172439123,172439203..172439492,172439577..172439714,172439891..172439970,172440140..172440239,172440328..172440457,172441129..172441352) |
Gene |
LOC110909917 |
GeneID |
110909917 |
Organism |
Helianthus annuus |
|
|
Length |
1252aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA396063 |
db_source |
XM_022154645.2
|
Definition |
DNA mismatch repair protein MSH6 [Helianthus annuus] |
CDS: ATGGCATTCCGTCGTCCAGCCAACGGCCGGTCGCCGCTCGTCAATCCCCAACGCCAAATCACCTCTTTCTTCTCCAAATCACCTTCTTCAACATCCTCTCATCCTCCTTCTCACTCACCATCCCCTATTTCTAACTCCAAACCTAACCCTAACCCTAACCCTAAACCTACTACAACTCCGTCGCCTCTACAAACCAAAAGCGCTAACAAACCGCCCTTAGTTATCGGCAATTCTCCTTCAACTCCTGCTTCCGGTGCCTCAAACCCTACCTACGGCGATGAAGTAGTTAACCGAAGAATTAGGGTTTACTGGCCTCTCGACAAGTGTTGGTACGAAGGTTGTGTAAAATCGTTCGATAAGAGTTCCGGTAAGCATTTGGTACAGTATGATGATGCTGAGGAGGAGCGTTTGGATTTATCTAAAGAGAAGATTGAGCTGTTGAAGGAGCAGGTTAAGAAGTTAAAACGGTTGAGGCGGGTTTCTGTCGAAGAGGATGAAGATGATGAGGCTGCAGGTGGTGTGGAAAGTGGTGGAGATGATTCGGCGGATGAGGATTGGGGGAAGAGTGTTGAGAAAGAAGTTGTTGATGATGAAATGGAGGATTTAGGGTTGGTGGATGAGGAAGAAGAAGAAGAAGTTGTTAAGGAAGTGAAGCAGGATCTTAAAAGGCGAAAGGTGTCCGGGACGAAATCGGATTCGGTTAAGAAGATAAAGACTGAATCTCCAAAGATTTTGAGTCCTAAAGTTAATAATAATTGTGGGAAGGCCACTATTATTGCTGACAATGTTCCAGTGGGCGATATGGCTGATAGATATGCCGCAAGAGAAGAAGAGAAATTCAGATTTCTTGGAAAGAACCGAAAGGATGCAAATAAGAGGTCCCCTGGAGATGAAAATTATGATCCAAGAACTCTATACATGCCTCCAGAATTCCTGAGGAGTTTAACAGGTGGCCAGAGGCAATGGTGGGAGTTCAAGTCAAAACATATGGATAAGGTTTTATTTTTTAAGATGGGAAAGTTTTATGAGCTCTTTGAAATGGATGCACATGTTGGGGCAAAAGAACTTGATCTGCAGTATATGAAGGGAGACCAACCGCATTGTGGATTCCCAGAAAAGAATTATGAAGTGAATGCAGAGAAGTTAGCTCGCAAGGGTTATCGTGTTCTAGTTGTTGAGCAGACAGAAACACCTGATCAGGCTGAGAAACGTCGCAAACAAGAAGGTTCTAAAGACAAGGTTGTGAAACGTGAAATATGTGCAGTGATCACCAAAGGGACATTGACTGACGGAGAAATGCTGTCAACCAATCCTGATGCTTCTTACCTGTTTGCAGTTGCTGAATGCTATGATGAAAACCAACAAGATGATAGAATATATGGTGTTTGTGTAGTTGATATTGCTACAAGCAAGATCATCATTGGACAGTTTAGTGATGATTCAGAATGCAGTGTGTTGAGCTGTCTATTGTCTCAATTAAGACCAGTGGAAATCATTAAACCTAAAAGAACGCTCAGCCCTGAAACCGAAAGAGTACTTTTGAGACAAACAAGAAGTCCCGTGATAAATGAATTAGTACCGGTTGAAGAGTTTTGGGATTCTGAAAAAACTATTCAAGAAATCAAGAAGATTTATCAACGTATCAGTAACCAATCGCAATCTGATAGTAAAGATTACCTACCAGAAATTCTCTCTGAGCTAATGACCGAAGGCAAAATTGGTAGTTTTGCACTCTCAGCCCTTGGTGGAACTTTATTTTATCTGAAAAAGGCCTTTTTGGATGAGTCATTGCTTCGGTTTGCAAAGTTTGAGCTACTTCCATGTTCTGGTTTTGCTGATGTCACCACAAAACCCTATATGATTCTTGATGCAACTGCTTTAGAGAATCTTGAAGTTTTTGAGAACAGTGTAAACGGGGACTCTAAAGGGACATTGTATGACCAACTAAACCGTTGTGTGACACCATTTGGGAAGAGATTGCTTAAAGCATGGCTTGCTAGACCGTTATATGACATAAACTCGATCAGAGAACGCCAAGAAGCTGTAGCTGGTGTTAAGGGAGCTAATCTGCCTCTTGCTCTTGAATTTCGTAAAGATTTGTCATTATTACCAGACATGGAACGGTTGCTTGCACGTATCTTTTCTTGCAGTGAAGCTAATGGTAGAAATTCAAGTAAAGTTATTCTCTATGAGGATGCAGCAAGGAAACAACTTCAAGAGTTTATAATAGTTCTCAGTGGCTGTGATGTACTTATAAATGCATGTGCTTCACTAGGTGCCATTCTGGAAAATACCGACTCTAAGCTGCTGCATCGCCTTTTAACACCTGGTAAAGATCATTCAGATGTTAATGCAGCTCTTAAGCATTTCAAAAACGCTTTTGATTGGATGGAAGCAAAAAGTTCAGGCCGTATAATTCCTCGGGATGGGGTTGATAATGAGTATGACTCCGCATGCAGAACTGTTACAGATATCGAATTTAGTTTGAAAAATCACTTAAAGGAACAGAGAAAACTACTTGGAGACTCATCTATCAATTATGTTACTGTTGGAAAAGACTCGTATCTTCTTGAAGTACCTGAAAGCTTATCAGGGAGTCTGCCTAGTGATTACGAACGACAATCATCCAAAAAGGGTGTCGCTCGTTATTGGACTCCTGCTATTAAGAAATATATTCGGGAGCTCTCAGAAGCTGAATCTGAGAAAGAGTCAAAGCTAAAAAGCATCATGCAGAGGCTAATCGGGCGCTTCTGTGAGCATCATGTTAGCTGGAGACAGTTAATTTCTAAGGCTGCAGAACTCGATGTCCTGATCAGCATAGCAATTGCAAGCGACCTCTATGAAGGACCAACATGTCGTCCACTTATTGTGGATTCATCAGTTGAAAACGAATCACCGTTTCTTGTTGCTAATAGTTTAGGTCATCCGATACTGAGAAACGATACTTTAGGGGATGGCACTTTTGTCCCAAATGATGTTTCTATAGGTGGCTCTAATAAGGCCAGATTTATCCTACTTACTGGTCCTAACATGGGTGGAAAGTCAACTCTTCTTCGCCAAGTTTGCTTAGCTGTTATTTTGGCTCAGGTGGGTGCTGATGTACCTGCAGAAAGCTTTAAGATGTCTCCAGTTGATCGTATTTTTGTGAGGATGGGTGCAAAAGACCATATTATGGCGGGCCAGAGTACATTTCTAACTGAGCTTCTGGAAACTGCATCTATGCTGTCATCAGCAACCCATAATTCACTTGTGGCATTAGATGAACTTGGACGGGGGACAGCTACTTCGGATGGACAAGCCATAGCTGCATCAGTTCTTGAGCACCTTGTCAACAAGGTCCAATGTAGGGGGTTATTTTCTACTCACTATCATCACTTAGCTTTGGACTATCAGCAAATTCCCAAGGTTTCGTTGTGTCACATGGCATGCAAAGTTGGAAAGGAGTTGGGAGGTCTAGAGGAGGTTACGTTTCTCTACAAATTGACACCTGGCGCATGCCCTAAAAGCTACGGTGTCAATGTTGCACGGCTAGCAGGACTTCCTGATGACGTGCTTAAAAAAGCCACAATCAAGTCAGAAGAATTTGAGACGATGTATGGTAACAAAAGAAACCAGTCCAACTGCACCAATATGACACCTGAGATTACGGTCTTCTTTCAGAGCTTAAGAACTTGTCTTGTGGATGCACGTTGCGGCTCTTCACCCCACGACATATACAAGCTACAACATAGGGCAAAGACACTTCTGGAGCAAAAGTAG |
Protein: MAFRRPANGRSPLVNPQRQITSFFSKSPSSTSSHPPSHSPSPISNSKPNPNPNPKPTTTPSPLQTKSANKPPLVIGNSPSTPASGASNPTYGDEVVNRRIRVYWPLDKCWYEGCVKSFDKSSGKHLVQYDDAEEERLDLSKEKIELLKEQVKKLKRLRRVSVEEDEDDEAAGGVESGGDDSADEDWGKSVEKEVVDDEMEDLGLVDEEEEEEVVKEVKQDLKRRKVSGTKSDSVKKIKTESPKILSPKVNNNCGKATIIADNVPVGDMADRYAAREEEKFRFLGKNRKDANKRSPGDENYDPRTLYMPPEFLRSLTGGQRQWWEFKSKHMDKVLFFKMGKFYELFEMDAHVGAKELDLQYMKGDQPHCGFPEKNYEVNAEKLARKGYRVLVVEQTETPDQAEKRRKQEGSKDKVVKREICAVITKGTLTDGEMLSTNPDASYLFAVAECYDENQQDDRIYGVCVVDIATSKIIIGQFSDDSECSVLSCLLSQLRPVEIIKPKRTLSPETERVLLRQTRSPVINELVPVEEFWDSEKTIQEIKKIYQRISNQSQSDSKDYLPEILSELMTEGKIGSFALSALGGTLFYLKKAFLDESLLRFAKFELLPCSGFADVTTKPYMILDATALENLEVFENSVNGDSKGTLYDQLNRCVTPFGKRLLKAWLARPLYDINSIRERQEAVAGVKGANLPLALEFRKDLSLLPDMERLLARIFSCSEANGRNSSKVILYEDAARKQLQEFIIVLSGCDVLINACASLGAILENTDSKLLHRLLTPGKDHSDVNAALKHFKNAFDWMEAKSSGRIIPRDGVDNEYDSACRTVTDIEFSLKNHLKEQRKLLGDSSINYVTVGKDSYLLEVPESLSGSLPSDYERQSSKKGVARYWTPAIKKYIRELSEAESEKESKLKSIMQRLIGRFCEHHVSWRQLISKAAELDVLISIAIASDLYEGPTCRPLIVDSSVENESPFLVANSLGHPILRNDTLGDGTFVPNDVSIGGSNKARFILLTGPNMGGKSTLLRQVCLAVILAQVGADVPAESFKMSPVDRIFVRMGAKDHIMAGQSTFLTELLETASMLSSATHNSLVALDELGRGTATSDGQAIAASVLEHLVNKVQCRGLFSTHYHHLALDYQQIPKVSLCHMACKVGKELGGLEEVTFLYKLTPGACPKSYGVNVARLAGLPDDVLKKATIKSEEFETMYGNKRNQSNCTNMTPEITVFFQSLRTCLVDARCGSSPHDIYKLQHRAKTLLEQK |